Categorizing ancient documents

نویسندگان

  • Nizar Zaghden
  • Rémy Mullot
  • Adel M. Alimi
چکیده

The analysis of historical documents is still a topical issue given the importance of information that can be extracted and also the importance given by the institutions to preserve their heritage. The main idea in order to characterize the content of the images of ancient documents after attempting to clean the image is segmented blocks texts from the same image and tries to find similar blocks in either the same image or the entire image database. Most approaches of offline handwriting recognition proceed by segmenting words into smaller pieces (usually characters) which are recognized separately. Recognition of a word then requires the recognition of all characters (OCR) that compose it. Our work focuses mainly on the characterization of classes in images of old documents. We use Som toolbox for finding classes in documents. We applied also fractal dimensions and points of interest to categorize and match ancient documents.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identifying and Categorizing the Dimensions of Iran's Health System Response to the Covid-19 Pandemic

Background and Aim: Coinciding with the onset of Covid-19, known as Corona in Iran, there have been many scattered reactions from the Iranian health system to the management of the disease. The aim of this study is to identify and categorize the dimensions of the Iranian health system response in order to identify points that have been overlooked and ignored. The results of this study can be us...

متن کامل

Watermarking Ancient Documents Based on Wavelet Packets

The ancient documents present an important part of our individual and collective memory. In addition to their preservation, the digitization of these documents may offer users a great number of services like remote look-up and browsing rare documents. However, the documents, digitally formed, are likely to be modified or pirated. Therefore, we need to develop techniques of protecting images ste...

متن کامل

Methods for Written Ancient Music Restoration

Access to collections of cultural heritage is increasingly becoming a topic of interest for institutions like libraries. With the easy access to information provided by technologies such as the Internet, new ways exist for consulting ancient documents without exposing them to more dangers of degradation. One of those types of documents is written ancient music. These documents suffer from multi...

متن کامل

Medical Education in Ancient Persia

Introduction: Historically, education is the prominent part of medical systems of different cultures. It has been surveyed in some cultures. In this study, we tried to uncover educational practices applied and present a sketch of medical education in ancient Persia (from beginning to 637 AD) based on the available evidence. Methods: In this study, old Persian scripts and other written document...

متن کامل

Ancient Document Analysis : A Set of New Research Problems

This paper deals with the problem of ancient document analysis. The first part of the paper is dedicated to a kind of state of art concerning french community projects, the purpose of which deals with the preservation and the exploitation of heritage documents. The second part focusses on a set of open issues, which should be tackled by the document analysis community, for the management of the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1308.6311  شماره 

صفحات  -

تاریخ انتشار 2013